From Appearance to Essence: Comparing Truth Discovery Methods without Using Ground Truth

نویسندگان

  • Xiu Susie Fang
  • Quan Z. Sheng
  • Xianzhi Wang
  • Wei Emma Zhang
  • Anne H. H. Ngu
چکیده

Truth discovery has been widely studied in recent years as a fundamental means for resolving the con icts in multi-source data. Although many truth discovery methods have been proposed based on di erent considerations and intuitions, investigations show that no single method consistently outperforms the others. To select the right truth discovery method for a speci c application scenario, it becomes essential to evaluate and compare the performance of di erent methods. A drawback of current research e orts is that they commonly assume the availability of certain ground truth for the evaluation of methods. However, the ground truth may be very limited or even out-of-reach in practice, rendering the evaluation biased by the small ground truth or even unfeasible. In this paper, we present CompTruthHyp, a general approach for comparing the performance of truth discovery methods without using ground truth. In particular, our approach calculates the probability of observations in a dataset based on the output of di erent methods. The probability is then ranked to re ect the performance of these methods. We review and compare twelve existing truth discovery methods and consider both single-valued and multi-valued objects. Empirical studies on both real-world and synthetic datasets demonstrate the e ectiveness of our approach for comparing truth discovery methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Truth Discovery from Conflicting Multi-Valued Objects

Truth discovery is a fundamental research topic, which aims at identifying the true value(s) of objects of interest given the conflicting multi-sourced data. Although considerable research efforts have been conducted on this topic, we can still point out two significant issues unsolved: i) single-valued assumption, i.e., current methods assume only one true value for each object, while in reali...

متن کامل

The Dialectic of Truth and Appearance in Lacan’s Reading of Gerhard Richter’s Overpainted-from-photographs

One of the most important contemporary theoretical approaches that can be used to analyze works of art is the theories of Jacques Lacan (1901-1980), a French post-structuralist psychoanalyst. By combining psychoanalysis, philosophy, linguistics, and anthropology, he analyzes the world of the human subject’s mind in a series of intertwined and extensive cultural and social relationships.This art...

متن کامل

Exploring Relevance as Truth Criterion on the Web and Classifying Claims in Belief Levels

The Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject. Several truth discovery methods have been proposed for various scenarios, and they have been successfully applied in diverse application domains. In this paper...

متن کامل

Applying a climatologically oriented GIS in comparison of TRMM estimated severe thunderstorm rainfalls with ground truth in Sydney metropolitan area

The main objective of the current research was comparison of severe thunderstorm rainfalls with TRMM data when flash flooding events observed in the Sydney Metropolitan Area (SMA) located in NSW, Australia. Severe Thunderstorm Rainfall Events have been first extracted from the severe storm archive of the Australian BOM, by induction of specific criteria. The corresponded derived dataset includ...

متن کامل

Data-Driven Evaluation of Non-Rigid Registration via Appearance Modelling

This paper presents a generic method for assessing the quality of non-rigid registration (NRR) algorithms, that does not depend on the existence of any ground truth, but depends solely on the data itself. The data is taken to be a set of images. The output of any non-rigid registration of such a set of images is a dense correspondence across the whole set. Given such a dense correspondence, it ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1708.02029  شماره 

صفحات  -

تاریخ انتشار 2017